Using Multiple Imputation Technique to Correct for Measurement Error and Statistical Disclosure Control in Sensitive Count Data in a National Survey
نویسنده
چکیده
Measurement error in sensitive question is pervasive, therefore, biasing the estimation of most statistical models. The objective of this paper is to correct for measurement error in the number of life-time sexual partners by treating it as a missing data problem and using multiple imputation technique to synthesize this underlying true attribute. Bayesian Poisson model with diffuse Gaussian priors was fitted to the 1996 General Social Survey combining knowledge of data quality from the mode experiment conducted by Tourangeau and Smith (1996). Ignored in existing literature, the threat of augmented disclosure harm from releasing both imputed and original data to the public was recognized and tackled by statistical perturbation. Bias reduction and statistical integrity were evaluated. Markov Chain Monte Carlo algorithm was programmed using WinBUGS.
منابع مشابه
Nonresponse prediction in an establishment survey using combination of statistical learning methods
Nonrespose is a source of error in the survey results and national statistical organizations are always looking for ways to control and reduce it. Predicting nonrespons sampling units in the survey before conducting the survey is one of the solutions that can help a lot in reducing and treating the survey nonresponse. Recent advances in technology and the facilitation of complex calculations...
متن کاملChapter 8 Multiple Imputation and Disclosure Protection : TheCase of the 1995 Survey of Consumer Finances Arthur
Donald Rubin has suggested many times that one might multiply impute all the data in a survey as means of avoiding disclosure problems in public-use datasets. Disclosure protection in the Survey of Consumer Finances is a key issue driven by two forces. First, there are legal requirements stemming from the use of tax data in the sample design. Second, there is an ethical responsibility to protec...
متن کاملMultiple imputation: an alternative to top coding for statistical disclosure control
Top coding of extreme values of variables like income is a common method of statistical disclosure control, but it creates problems for the data analyst. The paper proposes two alternative methods to top coding for statistical disclosure control that are based on multiple imputation. We show in simulation studies that the multiple-imputation methods provide better inferences of the publicly rel...
متن کاملFeasibility of using statistical tests in evaluation of non-uniformity [Persian]
Introduction: Non-uniformity test is essentially the only required daily QC procedure in nuclear medicine practice. Noise creates statistical variation or random error in a flood image. Non-uniformity on the other hand does not have statistical nature and may be regarded as systemic error. The present methods of non-uniformity calculation do not distinguish between these two types of erro...
متن کاملCombining synthetic data with subsampling to create public use microdata files for large scale surveys
To create public use files from large scale surveys, statistical agencies sometimes release random subsamples of the original records. Random subsampling reduces file sizes for secondary data analysts and reduces risks of unintended disclosures of survey participants’ confidential information. However, subsampling does not eliminate risks, so that alteration of the data is needed before dissemi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007